Picture for Huchuan Lu

Huchuan Lu

VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text?

Add code
Feb 04, 2026
Viaarxiv icon

Interactive Spatial-Frequency Fusion Mamba for Multi-Modal Image Fusion

Add code
Feb 04, 2026
Viaarxiv icon

Think3D: Thinking with Space for Spatial Reasoning

Add code
Jan 19, 2026
Viaarxiv icon

Towards Cross-Platform Generalization: Domain Adaptive 3D Detection with Augmentation and Pseudo-Labeling

Add code
Jan 13, 2026
Viaarxiv icon

The RoboSense Challenge: Sense Anything, Navigate Anywhere, Adapt Across Platforms

Add code
Jan 08, 2026
Viaarxiv icon

AR-MOT: Autoregressive Multi-object Tracking

Add code
Jan 05, 2026
Viaarxiv icon

Utilizing Earth Foundation Models to Enhance the Simulation Performance of Hydrological Models with AlphaEarth Embeddings

Add code
Jan 04, 2026
Viaarxiv icon

Living the Novel: A System for Generating Self-Training Timeline-Aware Conversational Agents from Novels

Add code
Dec 08, 2025
Figure 1 for Living the Novel: A System for Generating Self-Training Timeline-Aware Conversational Agents from Novels
Figure 2 for Living the Novel: A System for Generating Self-Training Timeline-Aware Conversational Agents from Novels
Figure 3 for Living the Novel: A System for Generating Self-Training Timeline-Aware Conversational Agents from Novels
Figure 4 for Living the Novel: A System for Generating Self-Training Timeline-Aware Conversational Agents from Novels
Viaarxiv icon

Parameter Aware Mamba Model for Multi-task Dense Prediction

Add code
Nov 18, 2025
Figure 1 for Parameter Aware Mamba Model for Multi-task Dense Prediction
Figure 2 for Parameter Aware Mamba Model for Multi-task Dense Prediction
Figure 3 for Parameter Aware Mamba Model for Multi-task Dense Prediction
Figure 4 for Parameter Aware Mamba Model for Multi-task Dense Prediction
Viaarxiv icon

Spatial-Frequency Enhanced Mamba for Multi-Modal Image Fusion

Add code
Nov 10, 2025
Viaarxiv icon